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AACTCCTTCTACACCCGAAAGGAATCCTATGACTATCAGGCAGCCTGTGCCCCTCAGCCTCCTGGGAACC 
TGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCACCATTGCAGATTTCCTTAACTTGGCCTGGTGGACCTC 
TGCTGCCGCCTGGTCCTTATTCCAGCAGCTACTCTATGGTCTCATCTATCACAGCTGGTTCCAGGCAGAC 
CCGGCAGAAGCTGAGGGCAGCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACT 
TCGGCTCGGTGAACGCATCCTATAATGCCATCATTGACTGCGGAAACTGCAGCAGGCTGTTCCATGCGCA 
GAGACTGACCAACACCAATCTCCTGTTCGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGTGCGAGGCGGGC 
CGGCTGCTGCAGAAGGAGACACACTCGGACGGCCCGGAGCAGTGTGAGCTGGTGCAGAGACCGAGATACC 
GAAGAGGTCCGCACATCTGTTTTGACTACAATGCGACGGAAGATACCTCAGACTGTGGCCGCGGAGCCTC 
CTTCCCTCCGTCGCTGGGCGTCTTGGTTTCCTTGCAGCTTTTGCTCCTCCTGGGCCTGCCACCTCGGCCG 
CAGCCTCAAGTCCACTCCTTCGCTGCCTCTCGCCACCTCTGAGCGACCCACACACACACATCATACCCCC 
GCCTTGGCCTCCTAGCCTTTCGCTCACCCTCCCATTCCACATTCCCCAATCTAGAGCCTTGGCCACTCTC 
TCCTGAAGGACCTGGGTCCCTTCCCCCGGAGCCTGTGCCTTGGGGCAGGGGAACCCCAAAGTAAGGTGCC 
ATGGTGTTTGGCACTCAAGATTTAGCTCACCCTTGAACTGTCCAAGTGCCCGCAGTCCCTAGACTCATCC 
CCGTGGGCTAGGACAGGAGGCCACTAGTACTGATGCCAAACCAGGCCTCCACCGACCCACCTGCCTGGAG 
ATTTCCTCTATGTAGGCAACCCTGCCACTGCTGGGCACCTCTAACTGGCCCTTTGGCCCCACCCAAGCCC 
AAACTTACCTTCTCTGGGGGAAAAAAAAAGGAAAGATGGTAATAGTGAGAGATTCGGGGGGCACCCCCTC 
CCCATTGGTTTCTGGCCCTTTCAGGCTACAACCCCCCAGCCTTGCAGGTGTCAGAACAGTCTCACAATGA 
CATCAGTTTAGACACATGCCATATACACTTGGATCTCTGAGAGCAGAAACCCAACTCTCACTAGACATAC 
CTGTGATGGAACACACAAACAGACACGCACCATGGGGGGTGGCCCACAAAGCCTTACACAAGGCGAGATG 
TCAATGAAGGGGTTGGCCTGTGTGTTCCATCTCTGCTCACCTCTGCCTCTACTCTGAGATGCAGCCTGGC 
TGATCCTCCCATCTCTAAAACTGAATGTCAAACCGTGCCAAATGCTGGGGGGGGGGGGAGACCTCTCTGT 
TTCACCCCTAGCCACCAGTGTCCCCAAGTGCCCCTCACCCTGCCAGGTGCTCATTGTAACCATCGTTCAC 
CAGTGTCCGGCCCCTAGTAGGACCACACATCACTGCCTGAACTCCTTTGGCAGAAGAACCCCACCAGACA 
TTGAGACATTGTATTTTGCCTTAGCAGGGATGAGTTGGTCTCTCCTGGCTGGGCCATCCCATCCCCAATC 
TGGTTCTTGCACACTCAGGCCTAATTCCCTCTGCACACACACACACACACACACACACACACACACACAC 
ACACACACACAGTCCCTGCCCCTAGGAGGCCAAATTACCCCTCCCTTGCTGAACACACCCTTGCACCATG 
CACATGTCTAACCAACCGTACTGCACACACAGAGGCTGGACCTGGGACACATCTCTTTACACCTTTCATT 
CTGTCATTTCTCCCAAAGGCATCGTAACTTGGGGGCCAGGAGGGGACTGAGGGGCAGGGGGGAGGGGTGT 
AGCTGTGAGGCTCAGATGGACTGGGAGGAGGGGGGAGGGTGATACATTAATTAATGGCTTCGTTAATTAA 
TGTCATGTTGCTTGTTGCTTTCTCAGTGTGTGTATGGTCCATGCCCAGTGCTGGTGACAGGGTGGGTATC 
CATGATGTGTGCCCAGCCTGGATGTCAGCTGTGTCCTGTGGGGGCGTGTGTGTAACTGTAGTGTAGTCAG 
GTGCTCAACGGAGAATATAAACAAAAAAAAAAGAAACAAACGTATACAGAAAAATAAATGTATATTTT^ 
GTTTAAAGACAAATGAAACCAGACAAAACAATCCCCATCAGGTAGTTGTCCAACCCCCAGCTGGGTTCAA 
CCCTCTCATTACCCACCTGACCTAGCTGTCCCCTTACTGTGGGCTGGGGGACTTGGGGGCCATTTCCTTT 
GCCCTTTTTTTTTGTTGTTATTCTATTTTGTACAGACAAGTTGGGAAAACAACAGCGACAAAAAAAAGTC 
GAGAAACTTTGTAAAATATTGTGTGTGTGATTCCTTGTAAAATATTTTCAAATGGTTTATTACAGAAGAT 
CAGTTATTAAATAATGTTCATATTTTCACTTC (SEQ ID N0:1) 
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NSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTIADFLNLAWWTSAAAWSLFQQLLYGLIYH 
SWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAI IDCGNCSRLFHAQRLTNTNLLFW 
AEKPLCSQCEAGRLLQKETHSDGPEQCELVQRPRYRRGPHICFDYNATEDTSDCGRGASFPPSL 
GVLVSLQLLLLLGLPPRPQPQVHSFAASRHL (SEQ ID NO: 2) 
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underlined = deleted in targeting construct 

BOLD = sequence flanking Neo insert in targeting construct 



AACTCCTTCTACACCCGAAAGGAATCCTATGACTATCAGGCAGCCTGTGCCCCTCAGCCT 
CCTGGGAACCTGGGTGCTGCA CC C C GGGGTGTC TTTGTGC C C AC C ATTGC AGATTTC C TT 
AACTTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCTTATT CCAGCAGCTACTCTATGGT 
CTCATCTATCACAGCTGGTTCCAGGCAGAC C CGGC AGAAGC TGAGGGC AGC C C C GAGAC G 
CGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCTCGGTGAACGCATCC 
TATAATGCCATCATTGACTGCGGAAACTGCAGCAGGCTGTTCCATGCGCAGAGACTGACC 
AACACCAATCTCCTGTTCGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGTGCGAGGCGGGC 
C GGC TGC TGCAGAAGGAGAC AC AC TC GGAC GGC C CGGAGC AGTGTGAGCTGGTGC AGAGA 
C C GAGATAC C GAAGAGGTC CGC ACATC TGTTTTGACTAC AATGCGACGGAAGATACCTCA 
GACTGTGGCCGCGGAGCCTCCTTCCCTCCGTCGCTGGGCGTCTTGGTTTCCTTGCAGCTT 
TTGCTCCTCCTGGGCCTGCCACCTCGGCCGCAGCCTCAAGTCCACTCCTTCGCTGCCTCT 
CGCCACCTCTGAGCGACCCACACACACACATCATACCCCCGCCTTGGCCTCCTAGCCTTT 
CGCTCACCCTCCCATTCCACATTCCCCAATCTAGAGCCTTGGCCACTCTCTCCTGAAGGA 
C C TGGGTC C C TTC C C C C GGAGC C TGTGCC TTGGGGC AGGGGAAC CC C AAAGT AAGGTGC C 
ATGGTGTTTGGCACTCAAGATTTAGCTCACCCTTGAACTGTCCAAGTGCCCGCAGTCCCT 
AGAC TC ATC CCCGTGGGC TAGGAC AGGAGGCC AC TAGTACTGATGCC AAAC CAGGC C TC C 
AC C GAC C C ACC TGC CTGGAGATTTC CTCTATGTAGGC AAC CC TGCCAC TGCTGGGC AC C T 
CTAACTGGCCCTTTGGCCCCACCCAAGCCCAAACTTACCTTCTCTGGGGGAAAAAAAAAG 
GAAAGATGGTAATAGTGAGAGATTC GGGGGGC ACCCCC TC CC CATTGGTTTC TGGCC CTT 
TCAGGCTACAACCCCCCAGCCTTGCAGGTGTCAGAACAGTCTCACAATGACATCAGTTTA 
GAC AC ATGCC ATATAC ACTTGGATC TC TGAGAGCAGAAAC CC AACTC TCAC TAGAC ATAC 
CTGTGATGGAACACACAAACAGACACGCACCATGGGGGGTGGCCCACAAAGCCTTACACA 
AGGCGAGATGTCAATGAAGGGGTTGGCCTGTGTGTTCCATCTCTGCTCACCTCTGCCTCT 
ACTCTGAGATGCAGCCTGGCTGATCCTCCCATCTCTAAAACTGAATGTCAAACCGTGCCA 
AATGCTGGGGGGGGGGGGAGACCTCTCTGTTTCACCCCTAGCCACCAGTGTCCCCAAGTG 
CCCCTCACCCTGCCAGGTGCTCATTGTAACCATCGTTCACCAGTGTCCGGCCCCTAGTAG 
GACC AC AC ATC AC TGCC TGAACTC CTTTGGC AGAAGAAC CC C ACC AGACATTGAGAC ATT 
GTATTTTGCCTTAGCAGGGATGAGTTGGTCTCTCCTGGCTGGGCCATCCCATCCCCAATC 
TGGTTCTTGCACACTCAGGCCTAATTCCCTCTGCACACACACACACACACACACACACAC 
AC AC AC AC AC AC AC AC AC AC AGTC CC TGC CC CTAGGAGGCCAAATTAC C C CTC C CTTGC T 
GAACACACCCTTGCACCATGCACATGTCTAACCAACCGTACTGCACACACAGAGGCTGGA 
CCTGGGACACATCTCTTTACACCTTTCATTCTGTCATTTCTCCCAAAGGCATCGTAACTT 
GGGGGCCAGGAGGGGACTGAGGGGCAGGGGGGAGGGGTGTAGCTGTGAGGCTCAGATGGA 
CTGGGAGGAGGGGGGAGGGTGATACATTAATTAATGGCTTCGTTAATTAATGTCATGTTG 
CTTGTTGCTTTCTCAGTGTGTGTATGGTCCATGCCCAGTGCTGGTGACAGGGTGGGTATC 
C ATGATGTGTGC C C AGC C TGGATGTC AGC TGTGTC C TGTGGGGGC GTGTGTGT AACTGTA 
GTGTAGTC AGGTGC TC AAC GGAGAATATAAACAAAAAAAAAAGAAAC AAAC GT ATAC AGA 
AAAATAAATGTATATTTTAAGTTTAAAGACAAATGAAACCAGACAAAACAATCCCCATCA 
GGTAGTTGTCCAACCCCCAGCTGGGTTCAACCCTCTCATTACCCACCTGACCTAGCTGTC 
CCCTTACTGTGGGCTGGGGGACTTGGGGGCCATTTCCTTTGCCCTTTTTTTTTGTTGTTA 
TTC TATTTTGTAC AGAC AAGTTGGGAAAAC AAC AGC GAC AAAAAAAAGTCGAGAAAC TTT 
GTAAAATATTGTGTGTGTGATTCCTTGTAAAATATTTTCAAATGGTTTATTACAGAAGAT 
C AGTTATTAAATAATGTTC ATATTTTC AC TTC 
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Gene Sequence Structure* 



Size of partial cDNA: 2552 bp 



83 bp 



Sequence Deleted 161 bp 




Targeting Vector* 
(genomic sequence) 



Arm Length: 
5': 3.5 kb 
3 f : 1.6 kb 



* Not drawn to scale 



LacZ-Neo 



5 1 >ACGGTGTAGGTCAGAGCCTTC I 
TAGAGCATGCCCTGTGACTATGGA ' 
TCTCTCCCTGCATCCCCAGGTTGG | 
CAGATTCTTCAGTGAGGTGGATGC j 
CAACCTGATGCTGGCACTGTACAA • 
TAACTCCTTCTACACCCGAAAGGA 1 
ATC CTATGACTATC AGGCAGCCTG ; 
TGCCCCTCAGCCTCCTGGGAACCT 1 
GGGTGCTGCAC<3 1 
(SEQ ID N0:3) i 



5 1 > C C AGC AGCT AC TCT ATGGTCT { 
CATC TATC AC AGC TGG TTCC AGGC j 
AGGTAAGTAGGGTTTAGGATGCTT 
GGCCCCAAATCTGTGTCCAGGGCG I 
GGAACAGATGCTCGGATCACAAGG I 
AGAGTGGGGC TTAGGGC TGC GCC A j 
AGCTGAGGCGGACGATTGTCTGTG 
GGCGGGGCTGAGGCGTCTGGGCCC j 
CGCAGACCCGGO' 
(SEQ ID NO: 4) 
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